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Abstract 

We significantly improve known time bounds for solving the minimum cut problem on undirected 
graphs. We use a "semi-duality" between minimum cuts and maximum spanning tree packings combined 
with our previously developed random sampling techniques. We give a randomized algorithm that finds 
a minimum cut in an m-edge, n-vertex graph with high probability in 0(m log^ n) time. We also give 
a simpler randomized algorithm that finds all minimum cuts with high probability in 0{n^ logn) time. 
This variant has an optimal TlNC parallelization. Both variants improve on the previous best time bound 
of O(n^log^n). Other applications of the tree-packing approach are new, nearly tight bounds on the 
number of near minimum cuts a graph may have and a new data structure for representing them in a 
space-efficient manner. 

1 Introduction 

The minimum cut problem has been studied for many years as a fundamental graph optimization problem 
with numerous appHcations. Initially, the problem was considered a harder variant of the s-t minimum cut 
problem and was solved in 0{mn^) time {0{f) denotes 0(/polylog/)) on m-edge, n-vertex graphs [FF62, 
GH61]. Improvements then followed: first to 0{mn) time [H094, NI92a], showing that minimum cuts were 
as easy to find as maximum flows; then to O(ri^) time [KS96], showing they were significantly easier. 

In this paper we give an algorithm with an Oim log"^ n) running time. Initially perceived as a harder 
variant of the maximum flow problem, the minimum cut problem turns out to be solvable in near-linear 
time. Side effects of our analysis include new combinatorial theorems on the structure, enumeration and 
representation of minimum cuts. We also give a relatively simple algorithm that runs in linear time on a large 
class of graphs and runs in 0{tt? logn) time on all graphs. This algorithm still dominates all previous ones 
and can also be parallelized optimally (that is, with time- processor product no worse than the sequential 
algorithm's) to yield the best-known TZMC algorithm for the problem. 

Our algorithm is based on a "semi-duality" between minimum cuts and undirected spanning tree packings. 
It is therefore fundamentally different from past approaches based on flows or on edge contraction. An 
algorithm developed by Gabow [Gab95] uses directed tree packing to solve the problem, but ours appears 
to be the flrst to use undirected spanning tree packings. It thus introduces a different approach to finding 
minimum cuts in undirected graphs. 



1.1 The Problem 

This paper studies the minimum cut problem. Given a graph with n vertices and m (possibly weighted) 
edges, we wish to partition the vertices into two non-empty sets so as to minimize the number (or total 
weight) of edges crossing between them. More formally, a cut (A, B) of a graph G is a partition of the 
vertices of G into two nonempty sets A and B. An edge (w,w) crosses cut {A,B) if one of v and w is in A 
and the other in B. The value of a cut is the number of edges that cross the cut or, in a weighted graph, the 
sum of the weights of the edges that cross the cut. The minimum cut problem is to find a cut of minimum 
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value. Wc use c to denote the value of this minimum cut. Throughout this paper, the graph is assumed to 
be connected, since otherwise the problem is trivial. We also require that the edge weights be non-negative, 
because otherwise the problem is A/'P-complete by a transformation from the maximum-cut problem [GJ79, 
page 210]. We distinguish the minimum cut problem from the s-t minim,um cut problem in which we require 
that two specified vertices s and t be on opposite sides of the cut; in the minimum cut problem there is no 
such restriction. 

1.2 Applications 

The minimum cut problem has several applications. Picard and Queyranne [PQ82] survey applications in- 
cluding graph partitioning problems, the study of project networks, and partitioning items in a database. In 
information retrieval, minimum cuts have been used to identify clusters of topically related documents in hy- 
pertext systems [Bot93] . The problem of determining the connectivity of a network arises frequently in issues 
of network design and network reliability (one of our previous papers [Kar97b] exploits an extremely tight 
connection between minimum cuts and network reliability). Minimum cut computations are used to find the 
suhtour elimination constraints that are needed in the implementation of cutting plane algorithms for solving 
the traveling salesman problem [DFJ54, LLKS85]. Padberg and Rinaldi [PR90] and Applegate [App92] have 
reported that solving min-cut problems was the computational bottleneck in their state-of-the-art cutting- 
plane based TSP algorithm, as well as other cutting-plane based algorithms for combinatorial problems 
whose solutions induce connected graphs. 

1.3 Past Work 

The minimum cut problem was originally approached as a variant of the s-t minimum cut problem and 
solved using flow techniques [FF62]. The most obvious method is to compute s-t min-cuts for all vertices 
s and t; this requires (2) flow computations. Gomory and Hu [GH61] showed that the problem could be 
solved by n flow computations; using present flow algorithms [GT88] this gives a time bound of 0{mnP). 
After a lengthy period of little progress, several new algorithms appeared. Hao and Orlin [H094] showed 
how the n flow computations could be performed in the time for one, improving the running time to 0{mn). 
Gabow [Gab95] gave an alternative "augmenting- tree" scheme that required 0{mc) time on an unweighted 
graph with minimum cut c. 

Sampling as an attack on minimum cuts was introduced in [Kar97a] . We showed that sampling could be 
used to find approximate minimum cuts in 0{m) time and to find them exactly in 0{m^/c) time (combining 
sampling with Gabow's algorithm). The sampling technique is crucial to this paper as well. 

The above approaches all use maximum flow techniques that treat undirected graphs as directed graphs 
(Gabow's algorithm packs directed trees rather than directed paths, but still addresses directed graphs). 
Recently a flowless undirected-graph approach based on edge contraction was discovered. If we can identify 
an edge that does not cross the minimum cut, then we can merge its endpoints into a single new vertex 
without affecting the minimum cut. Performing n — 2 valid contractions will reduce the graph to two 
vertices, at which point there is only one nontrivial cut that must therefore correspond to the minimum cut. 
Nagamochi and Ibaraki [NI92b, NI92a] devised a sparse certificate computation that found a contractible 
edge in 0{m) time; this led to an 0(mn)-time minimum cut algorithm. Karger and Stein [KS96] showed 
that random edge contraction worked well, leading to an algorithm running in 0(n^ log'^ n) time. 

1.4 Our Results 

In this paper wc present a different approach to minimum cuts that yields faster algorithms than were 
previously known. We use a "semi-duality" between minimum cuts and maximum packings of undirected 
spanning trees — arguably a more natural dual than the directed flows or directed spanning trees used previ- 
ously [GH61, H094, Gab95]. Our approach does not rely fundamentally on flows or edge contractions. For 
a weighted, undirected graph we give: 

• A randomized algorithm that finds a minimum cut with constant probability in 0{m\o^ n) time and 
with high probability in O(mlog'^n) time. This significantly improves the previous best 0(r7,^log^n) 
bound of [KS96]. A technical refinement improves the time bound by an additional log log n factor. 
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• A variant that finds a minimum cut with constant probabihty in O(n^) time and finds all (possibly 
f2(n^)) minimum cuts with high probabihty in 0(n^ log n) time. While this only gives a speedup of 
0(log^n) over the algorithm of [KS96], the new algorithm is quite simple, relying only on computing 
minimum spanning trees, finding least common ancestors and evaluating expression- trees. It runs in 

0{m) time on a natural class of graphs. 

• A parallel version of our simpler algorithm that runs in 0(log'^ n) time using n^/ log^ n processors. 

• New bounds of 8(nL^"J ) on the number of cuts with value at most a times the minimum that a graph 
can have. For non-half-integral a this improves on a previous best general bound of 0(n^") [Kar93a]. 
This bound is the first to match the "step function" behavior of the known lower bounds with respect 
to a. 

• A new data structure that represents all minimum and near- minimum cuts. Besides being smaller than 
previous representations [KS96], this representation highlights certain structural features of a graph's 
near-minimum cuts. 

1.5 Our Methods 

We now summarize the approach of the paper. At its core is the following definition: 

Definition 1.1. Let T be a spanning tree of G. We say that a cut in G k-respects T if it cuts at most k 

edges of T. We also say that T k-constrains the cut in G. 

Nash- Williams [NW61] proved that any any graph with minimum cut c contains a set of c/2 edge-disjoint 
spanning trees. These trees must divide up the c minimum cut edges. It follows that any such tree packing 
contains a tree that 2-constrains the minimum cut. Using this idea, we show that for any graph G, we can 
find (by packing trees) a small set of spanning trees such that every minimum cut 2-respects some of the 
spanning trees. This lets us reduce to the following problem: find the minimum cut in G that 2-respects 
a given spanning tree. While this might appear a harder problem than the original, it turns out that the 
added constraints make the problem easier. Our scheme can be thought of as a variant of the "branch and 
bound" techniques used for hard optimization problems: we consider several families of added constraints, 
one of which is satisfied by the optimum solution. We can find the optimum by solving each constrained 
optimization problem. 

To introduce our approach, we show in Section 2 that a maximum packing of spanning trees always 
contains many trees that 2-constrain any minimum cut. An immediate application of this observation in 
Section 3 yields the tightest known bounds on the number of near-minimum cuts in a graph. 

Next we turn to the problem of actually finding the minimum cut. It follows from the previous para- 
graph that we can find the minimum cut by examining each tree of a maximum packing and finding the 
smallest cut that 2-respects each tree. Although finding a maximum tree packing is hard, we show in 
Section 4 that Gabow's minimum cut algorithm [Gab95] or the Plotkin-Shmoys-Tardos fractional packing 
algorithm [PST91] can be used to find a packing that is sufficient for our purposes. Unfortunately, in a 
graph with minimum cut c, such a packing contains roughly c trees, so checking all of them is prohibitively 
slow. Indeed, even finding the packing (with either algorithm) takes time proportional to c. To eliminate 
this factor, we use random sampling techniques that we developed previously [Kar97a, Kar94] to reduce the 
packing time to 0(m -|- nlog^ n) and the number of trees in the packing to O(logn). 

Once we have the packing, it remains to examine each tree in it. We begin in Section 5 by showing how 
to find the smallest cut that 1-respects a given tree in 0{m) time. Our algorithm involves a simple dynamic 
program that determines the value of the cut "induced" by removing each edge of the given tree. Besides 
introducing our techniques, this algorithm can be used to find the minimum cut in 0(m -|- nlog"* n) time on 
a natural family of "fat" graphs containing many spanning trees. 

In some cases we will not be able to find a tree that 1-constrains the minimum cut, so we move on to 
finding the minimum cut 2-respecting a given tree. We present two solutions. In Section 6 we extend the 
simple dynamic program of Section 5 to compute the value of the cut induced by removing each pair of tree 
edges. The algorithm runs in O(n^) time — optimal for an algorithm that computes the values for all (2) 
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pairs. By running on O(log n) trees it identifies all niinirnuni cnits with high probabihty in 0{n^ logn) time. 
In Section 6.2, we also show how it can be parallelized optimally. 

The second algorithm, presented in Section 7, implicitly considers all (") pairs of tree edges without 
taking ri(n^) time to do so. We show how dynamic tree data structures [ST83] can be used to evaluate a 
particular tree in 0(m log^ n) time. Thus evaluating the necessary O(logn) trees takes 0(m log^ n) time and 
gives the claimed bound. 

We finish by giving some extensions of the algorithm: a fast and simple algorithm for a large class of 
"fat" graphs, and some refinements that improve the running time of our complicated algorithm by some 
small factors. 

The initial tree-packing step is the only one using randomization; all subsequent operations in the al- 
gorithm are deterministic. The fact that random sampling is used means that our algorithms, like those 
of [KS96], are Monte Carlo algorithms. Although the probability of success is high, there is no fast way to 
certify the correctness of the final answer (indeed, the fastest known method is to use one of the deterministic 
minimum cut algorithms). 

2 Cuts and Tree Packings 

In this section, we give some background on tree packings and their connection to cut problems. For the 
moment, let us restrict attention to unweighted graphs. An unweighted tree packing is a set of edge-disjoint 
spanning trees in a graph. The value of the packing is the number of trees in it. Clearly, every spanning tree 
in an unweighted packing must use at least one edge of every cut. Thus, at most c trees can be packed in a 
graph with minimum cut c. Nash- Williams [NW61] gave a related lower bound: 

Theorem 2.1 ([NW61]). Any undirected graph with minimum cut c contains a tree packing of value at 
least c/2. 

Nash- Williams actually proved a tighter result: if Cr is the value of the minimum r-way cut in the graph, 
then the value of the maximum tree packing is exactly [minrCr/ir — 1)J. But observe that the minimum 
r-way cut must have Cr > rc/2, since each of the r components of the cut must have c edges leaving it. Thus 

c rc 

min — ^ > min — — > c/2 

r-1 - 2{r-l) - ' 

Theorem 2.1 is not universally tight — for example, a single tree has minimum cut 1 and contains one spanning 
tree. However, it is existentially tight since, for example, a cycle has minimum cut 2 but has maximum tree 
packing value 1. We will find later that the tightness of a graph with respect to tree packings is a factor 
that determines the hardness of finding its minimum cuts using our algorithm. The factor-of-2 gap is 
particular to undirected graphs: Edmonds [Edm72] proved that a directed graph with directed minimum cut 
c has a packing of exactly c edge-disjoint directed spanning trees (Gabow exploits this in his minimum cut 
algorithm [Gab95]). 

Since any graph has a packing with c/2 trees, it follows that a maximum packing will contain at least c/2 
trees. Consider any minimum cut. Since the edges of the minimum cut must be partitioned among the trees 
of the maximum packing, the average number of min-cut edges per tree is at most c/(c/2) = 2. It follows 
that at least one of the trees in such a packing has at most 2 minimum-cut edges. In other words at least 
one tree 2-constrains the minimum cut. 

We will strengthen this argument in Lemma 2.3. First let us see why it is useful. If we are given a 
tree, and are able to determine which of its edges cross the minimum cut, then in fact we will have found 
the minimum cut. Consider a tree in which all minimum cut edges have been marked. Start at any vertex 
and traverse the tree path to any other vertex. Each time we cross a min-cut edge, we know that we have 
switched sides of the minimum cut. It follows that two vertices are on the same side of the minimum cut if 
and only if the number of minimum cut edges on the path between them is even. More concretely, suppose 
a tree has only one minimum cut edge. Then removing this edge separates the tree into two subtrees, each 
of which is one side of the minimum cut. If a tree contains two minimum cut edges, then removing them 
separates the tree into three pieces. The "central" piece that is adjacent to both other pieces forms one side 
of the minimum cut, while the two remaining pieces form the other side. 
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The above paragraph demonstrates that any snbset of the tree edges defines a unique two-way cut of 
the graph G. It is also clear that every cut defines a unique set of tree edges — namely, those crossing the 
cut — and that this correspondence between cuts and sets of tree edges is a bijection. 

2.1 Weighted Packings 

We now extend the above discussion to argue that many of the trees in a maximum packing must 2-respect 
that minimum cut. This is useful since it means that a randomly selected tree is likely to 2-constrain the 
minimum cut. We also generalize the tree packing definition to allow for weighted trees packed into weighted 
graphs. 

Definition 2.2. A (weighted) tree packing is a set of spanning trees, each with an assigned weight, such 
that the total weight of trees containing a given edge is no greater than the weight of that edge. The value 
of the packing is the total weight of the trees in it. 

If all edge and tree weights are integers, we can treat a weight-fw edge as a set of w parallel edges, and a 
weight w tree as a set of w identical trees. This recovers the unweighted notion of a tree packing as a set of 
edge-disjoint trees. The value of the packing becomes the number of (unweighted) trees. The definition also 
allows for fractional weights. This can be ignored, though, since we can always multiply all edge and tree 
weights by the least common denominator of the fractions to return to the integral version. It follows that 
Nash- Williams' Theorem 2.1 applies unchanged to weighted tree packings. 

Lemma 2.3. Given any weighted tree packing of value (3c and any cut of value ac, at least a \{2> — a/ (3) 
fraction of the trees (by weight) 2-constrain the cut. 

Corollary 2.4. In any maximum packing, half the trees (by weight) 2-constrain the minimum cut. 

Proof. Suppose that tree T has weight wt (so wt = 13c) and that we choose a tree T at random with 
probability proportional to its weight. Define the random variable Xt to be one less than the number of 
edges of T crossing the a-minimum cut. Note that is always a nonnegative integer. Since we have a 
packing, we know that no a-minimum cut edges arc shared between the trees. It follows that 

'^^wt{xt -|- 1) < ac 

^^wtsjt < oic 
< ac 

and thus 




E[xt\ = WtXt 

< {alf3 - 1) 

It follows from Markov's inequality that xt < 2 with probability at least 1 — l/2[a/ (3 — 1) = ^(3 — a/ (3). 
Since Xt is an integer, Xt <2 implies that < 1, meaning that the a-minimum cut 2-respects T. 

The corollary follows immediately by taking a = 1 and observing the /3 > 1/2 by Theorem 2.1. □ 



3 Combinatorics 

As a first application of the concept of respecting constraint trees, we tighten the bounds on the number 
of small cuts in a graph. This section can be skipped without impacting the remainder of the paper. Our 
discussion is limited to two-way cuts — that is, sets set of edges crossing a two-way vertex partition. In 
contrast, the combinatorial results of [KS96] also apply to multiway cuts. 

Definition 3.1. A cut in G is a-minimum if its value is at most a times the minimum cut value. 
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In [Kar93a, Kar97a], wc proved that the mimbcr of Q-minimum cuts in a graph is 0(«,^"). Others have 
since tightened this bound for small a: Benczur [Ben95] bounded the number of 6/5-minimum cuts by O(n^), 
Nagamochi, Nishimura and Ibaraki [NNI94] gave a bound of O(ri^) for the number of 4/3-minimum cuts, 
and subsequently Henzinger and Williamson [HW96] showed that the number of cuts strictly less than 3/2 
times the minimum is 0(n-^). In an unweighted n- vertex cycle, any even-size set of up to [2aJ edges is the 
edge set of an a-minimum cut. Therefore, the number of a-minimum cuts can be as large as 

n\ fn\ _ _ f n \ L2aJ 



We give an upper bound that is very close to this lower bound, matching its "step function" behavior as a 

function of a. 

Our bound uses tree packings. It simplifies the proof to restrict attention to unweighted tree packings — 
collections of edge-disjoint spanning trees — in unweighted graphs. The discussion of the previous section 
showed that this loses no generality. We begin with a weak but easily proved lemma: 

Lemma 3.2. For any constant a, there are 0(n'-^'*J) a-minimum cuts. 

Proof. Let k = [2a\. Number the a-minimum cuts from 1 to i. Our goal is to bound i by 0{n''). 

Recall Nash- Williams' Theorem 2.1 [NW61] which says that any graph with minimum cut c contains at 
least c/2 edge-disjoint spanning trees. Suppose that we pick one of these trees at random. Let j/j = 1 if we 
pick a tree that A;-constrains the i*^ a-minimum cut and otherwise. Since this cut contains at most ac edges 
and each edge is in at most one of our c/2 spanning trees, the expected number of cut edges in our chosen 
spanning tree is at most 2a. It follows from Markov's inequality [MR95] and the fact that 1 -|- [2aJ > 2a 
that the chosen tree has at most k = [2aJ cut edges with constant probability. That is, t/, = 1 with constant 
probability, meaning E[yi\ = f2(l). Thus 



On the other hand, as discussed in the previous section, each cut that fc-respects a given tree is in 1-1 
correspondence with the set of at most k edges of the tree that it cuts. There are 0{n^) such sets of edges, 
so no tree can fc-constrain more than this many cuts. Thus 

E^Vi] < max-Y^Vi = 0{n''). 

Combined with the previous bound, this means £ = 0{n'') as claimed. □ 

We now give a stronger result with a more complicated proof. 
Theorem 3.3. The number of a-minimum cuts is at most 

^ ^ " \l + 0(l/n)). 



[2aJ -I- 1 - 2a VL2a'J 



Remark. For a a half integer, this quantity is within 1 -|- o(l) of the lower bound in Equation 1. The bound 
remains tight to within a constant factor for all values of a except those infinitesimally less than an integer 
(e.g., a = 3 — l/^^)• For such values, we get a better boimd by considering cuts of value at most [2a] /2; 
since this is a half-integer the theorem gives a bound of {^2a])(^ + 

Proof. As before, we assume G is unweighted and pack c/2 edge-disjoint trees in the graph. Consider a 
bipartite graph B, with one side consisting of the trees in the Nash- Williams packing, and the other of the 
a-minimum cuts. To avoid confusion, we refer to the vertices of this bipartite graph as nodes and to its 
edges as arcs. Draw an arc from a tree-node T to a cut-node C if the tree /c-constrains the cut in G. We 
will now assign weights to these arcs so that: 

1. every tree node in B has weighted arc degree at most some D; 
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2. every cut node in B has weighted arc degree at least some d 

Since there are c/2 trees in the packing, we see from (1) that the total weight of arcs in B is at most cD/2. 
It follows from (2) that the number of cuts in B is at most cD/2d. We show that this quantity can be made 
small by an appropriate choice of arc weights. 

We now define the weight assignment. Let k = [2aJ . For a given arc (T, C), if tree T r-constrains cut C 
for some r <k but does not (r + l)-constrain C, then we give the arc weight 1 + k — r. If more than k edges 
of T cross the cut C, we give the arc weight 0. Since a tree can exactly r-constrain at most (") distinct cuts, 
if follows that the tree-side degree 

E (")(l + ^-r) = (")(l + 0(l/n)). 

Now consider the cut-side degree d. Take any cut C. For tree T, let rr denote the number of edges of T 
that cross C. Then the weighted degree of C can be written as 

Y.{l + k-rr)>Y.^l + k-rT) 

rT<k T 

= {l + k){c/2)-J2rT 

T 

> {1 + k){c/2) - ac 
= {l + k-2a){c/2), 

where the first inequality follows from the fact that all the terms being added to the summation on the right 
hand side are negative, and the second from the fact that J^t — ^i^ce each of the ac edges of the cut 
contribute to Tt for at most one T. This is a lower bound on the weight d incident on any cut-side node in 
B. Since we have shown above that D < (^!)(1 + 0{l/n)), it follows that our bound on the number of cut 
nodes, cD/2d, is no more than the bound claimed in the statement of the theorem. 

Note that our weight assignment analysis works for any value of k, but that taking k = [2a\ gives the 
best ratio. An argument based on linear programming duality can be used to show that this proof cannot 
be improved: no other assignment of arc weights can give a better bound. □ 

3.1 Weighted Graphs 

Although our proof discussed only unweighted graphs, it clearly extends to weighted graphs as well. One 
way to see this is to redo the proof with a weighted tree packing, but this becomes notationally messy. A 
simpler way to extend the argument to graphs with integer edge weights is to replace a weight-w edge with w 
parallel unweighted edges, creating an unweighted graph with the same near-minimum cuts. This argument 
can be extended to graphs with rational edge weights as well: simply multiply all edge weights by their 
least common denominator, creating an integer-weighted graph with the same near minimum cuts. Finally, 
real valued edge weights can be handled by considering them as the limits of rational- valued edge weight 
sequences. 

3.2 Discussion 

Note that a cycle has exactly {^2a]) + ' ' ' + (2) minimum cuts. Thus for a a half integer, our upper bound 
is tight to within 1 -|- o(l). The gap widens when a is between two half-integers, reaching Q{n) when a is 
infinitesimally less than a half-integer. No family of graphs has been exhibited which has more a-minimum 
cuts than the cycle, so one is tempted to conjecture that the bound for the cycle is also the upper bound. 
The only counterexample of which we are aware is the 4-clique [NNI94], which has 10 4/3-minimum cuts 
(each singleton or pair of vertices) as compared to the 6 of a 4-cycle. But perhaps this is a unique exception. 

Another interesting open question regards the number of cuts strictly less than a times the minimum. 
For a a half integer, our above theorem only gives a bound of 0{n^^'^^ ); a bound of 0{n^^'^^ ), as exhibited 
by the cycle, seems more plausible. Such a bound has been proven for a = 3/2 [HW96]. 
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4 Algorithms for Finding Good Trees 



We now apply the ideas of the previous combinatorial sections algorithmically. We prove that in any graph, 

wc can find quickly a small set of trees such that the minimum cut 2-respects some of them. Therefore, we 
can find the minimum cut by enumerating only the cuts that 2-respect these few trees. 

4.1 Packing Algorithms 

Finding the trees is in a sense trivial from the preceding discussion, which showed that a maximum tree 
packing had the right property. Unfortunately, maximum tree packings in undirected graphs are hard to find. 
Gabow and Westermann [GW92] gave an algorithm for unweighted graphs that runs in 0(min(mn, m? / ^/n)) 
time. Barahona [Bar95] gives an algorithm with an 0{mn) running time for weighted graphs. Both of these 
running times are dramatically larger than the one we want to achieve, so we will use two other approaches 
that find non-maximum tree packings. 

One approach is due to Gabow. Although it does not yield a maximum packing, it yields a packing of value 
c/2 — sufficient for out purposes. Gabow's algorithm is actually an algorithm for directed graphs: in such a 
graph, it finds the c directed spanning trees guaranteed by Edmonds' theorem [Edm72] in 0{mc\og{'n? /m)) 
time. We can use this algorithm by turning each of our undirected graph's edges into two edges, one in 
each direction. This gives us a directed graph with minimum cut c. Gabow's algorithm finds a packing of c 
directed-edge disjoint trees in this graph. If we now ignore edge directions, wc get a set of c trees such that 
each of our undirected edges is in at most 2 trees. It follows that if we give each of our trees a weight of 1/2, 
we have a packing of value c/2 to which our previous arguments (Lemma 2.3) apply. 

An alternative approach is due to Plotkin, Shmoys, and Tardos [PST91]. They give an algorithm that 
packs spanning trees by repeatedly adding one weighted tree to the packing. Which tree to add and the 
weight it is given are determined by a minimum-cost spanning tree computation, using costs determined 
by the current packing. After 0(c/e^) iterations (which take time 0(mc/e^)), the packing has value at 
least (1 — e) times the maximum, and thus at least (1 — e)c/2. Although it is more complicated that 
Gabow's algorithm in that it works with fractional value, this method has the attraction of working only 
with undirected spanning trees, emphasizing that our algorithm is able to avoid any reliance on directed- 
graph algorithms. It will also be useful when we develop parallel algorithms. 

4.2 Sampling 

Unfortunately, both of the above algorithms have running times dependent on c, so using them directly is 
unacceptable. However, we now show use a random sampling step to reduce the effective minimum cut in 
the graph to O(logn), which means that both schemes can run in 0(m) time. 

Theorem 4.1. Given any weighted undirected graph G, in 0{m + nlog^n) time we can construct a set of 
O(logn) spanning trees such that the minimum cut 2-respects 1/3 of them with high probability. 

Proof. In a previous paper [Kar97a], we showed how to construct, in linear time for any e, a skeleton graph 
H on the same vertices with the following properties: 

• H has m' = O(ne^^logn) edges, 

• the minimum cut of H is c' = 0{e^^ logn), 

• the minimum cut in G corresponds (under the same vertex partition) to a (1 -h e)-times minimum cut 
of H. 

We already know that any tree packing of value c'/2 in the skeleton H will have many trees that 2- 
constrain the minimum cut in H . Intuitively, for small e, since the minimum cut in G is a near-minimum 
cut in H, the tree packing will also contain many trees that 2-constrain this near minimum cut. We now 
formalize this intuition. 

Set e = 1/6 in the skeleton construction. Since H has minimum cut c', Gabow's algorithm can be used to 
find a packing in H of weight c'/2 in 0{m'c' logn) — 0{n\og'^ n) time. The original minimum cut of G has 
at most (1 + e)c' edges in H, so by Lemma 2.3, a fraction |(1 — 2e) = 1/3 of the trees 2-constrain this cut 
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in H . But this (1 + g)-minimum cut of H has the same vertex partition as the minimum cut of G, implying 
that the same trees 2-constrain the minimum cut of G. □ 



Remark. The randomized construction of the skeleton in this proof is the only step of our minimum cut algo- 
rithm that involves randomization. A deterministic replacement of Theorem 4.1 would yield a deterministic 
minimum cut algorithm. 

An alternative construction based on the Plotkin-Shmoys-Tardos algorithm can also be applied to the 
sample. We can use that algorithm to find a tree packing of value (1 — 5)c' /2 in O(nlog^n) time for any 
constant 5. Lemma 2.3 again shows that for ^ = e = 1/6, say, a 1/10*'* fraction of the packed weight 
2-constrains the (1 + e)-minimum cut in H that corresponds to the minimum cut in G. Thus, if wc choose a 
random tree (with probability proportional to its weight), there is a constant probability that it 2-constrains 
the minimum cut. Performing O(logn) such random selections picks at least one tree that 2-constrains the 
minimum cut with high probability (In fact, since the algorithm only packs 0{{\ogn) / 5"^) distinct trees we 
can even afford to try them all). This gives another 0{m + nlog'^ n)-time algorithm for finding a good set 
of trees. It also has the following corollary that will be applied to our parallel minimum cut algorithm. 

Corollary 4.2. InTZAfC using m + nlogn processors and 0(log'' n) time, we can find a set ofO(logn) span- 
ning trees such that with high probability, a constant fraction of them (by weight) 2-constrain the minimum 
cut. 

Proof The skeleton construction can be performed as before (and is trivial to parallelize [Kar97a]). We have 
just argued that the algorithm of [PST91] can find an approximately maximum packing in the skeleton using 
0(log^ n) minimum spanning tree computations. Minimum spanning trees can be found in parallel using m' 
processors and O(logn) time [JM92], and the other operation of [PST91] are trivial to parallelize with the 
same time and processor bounds. Thus the claimed boimds follow. □ 

The remainder of this paper is devoted to the following question: given a tree, find the minimum cut 
that 2-respects it. Applying the solution to the O(Iogn) trees described by the previous lemma shows that 
we can find the minimum cut with high probability. This gives the following lemma: 

Lemma 4.3. Suppose the minimum cut that 2-respects a given tree can he found in T{m,n) time. Then the 
minimum cut of a graph can he found with constant probability in T{m, n) + 0(m + nlog^ n) time and with 
high probability in 0(T{m, n) log n + m-\-n log"^ n) time. 

Proof. We have seen above that a constant fraction (by weight) of the trees in our skeleton packing 2- 
constrain the minimum cut. So choosing a tree at random and analyzing it yields the minimum cut with 
constant probability. Trying all O(logn) trees identifies the minimum cut with high probability so long as 
the skeleton construction worked, which happens with high probability. □ 

We give two algorithms for analyzing a tree. The first algorithm (in Section 6) uses the fact that a tree 
contains only (j) pairs of edges, one of which defines the minimum cut 2-respecting the tree. A very simple 
dynamic programming step is used to compute the cut values defined by all (2) pairs of edges in O(n^) time. 
The second algorithm (in Section 7) aims for a linear-time bound, and must therefore avoid enumerating all 
pairs of edges. We describe local optimality conditions showing that only certain pairs of edges can possibly 
be the pair defining the minimum cut, and give an algorithm that runs quickly by only trying these plausible 
pairs. 

5 Minimum Cuts that 1-Respect a Tree 

To introduce our approach to analyzing a particular tree, we consider a simple special case: a tree that 
1-constrains the minimum cut. In such a tree there is one tree edge such that, if we remove it, the two 
resulting connected subtrees correspond to the two sides of the minimum cut. This section is devoted to a 
proof of the following: 

Lemma 5.1. The values of all cuts that 1-respect a given spanning tree can be determined in 0{m-\-n) time. 
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Corollary 5.2. The minimum cut that 1-respects a given spanning tree can be found in 0{m + n) time. 

We have already seen that being able to identify all cuts that 2-respect a tree will let us find the minimum 
cut. Later, we will exhibit a class of graphs for which finding the minimum 1-respecting cut will suffice for 
finding the minimum cut. 

We begin with some definitions. Suppose that we root the tree at an arbitrary vertex. 

Definition 5.3. is the set of vertices that are descendants of v in the rooted tree, including v. 
Definition 5.4. is the set of vertices that are ancestors of v in the rooted tree, including v. 
Note that fW^ = v. 

Definition 5.5. C{X, Y) is the total weight of edges crossing from vertex set X to vertex set Y. 

In particular, C{v,w) is the weight of the edge {v,w) if it exists, and otherwise. 

Definition 5.6. C{S) is the value of the cut whose one side is vertex set S, i.e. C{S, S). 

Once wc have rooted the tree, the cuts that 1-respect the tree have the form C{v^) for vertices v. Using 
this observation, we now prove Lemma 5.1. 

5.1 1-respecting a path 

As a first step, suppose the tree is in fact a path vi,. . . ,i;„ rooted at vi. We compute all values C(u|) 
in linear time using a dynamic program. First compute C{vi,vl) and C{vi,v'l) for each w^; this takes one 
0(m)-time traversal of the vertices' adjacency lists. We now claim that the following recurrence applies to 
the cut values: 

C{vi) = CiVn,vl) 

C{vj) = C{vl,)+C{vuvj)-C{vi,vl) 

This recurrence follows from the fact that when we move Vi from below the cut to above it, the edges from 
Vi to its descendants become cut edges while those from Vi to its ancestors stop being cut edges. It follows 
that all n cut values can be computed in 0{n) time by working up from Vn- 

5.2 1-respecting a tree 

We now extend our dynamic program from paths to general trees. 

Definition 5.7. Given a function / on the vertices of a tree, the treefix sum of /, denoted /-'-, is the function 

Lemma 5.8. Given the values of a function f at the tree nodes, all values of f^ can he computed in 0{n) 
time. 

Proof. Perform a postorder traversal of the nodes. When we visit a node v, we already will have computed 
(by induction) the values at each of its children. Adding these values takes time proportional to the degree 
of v; adding in f{v) gives us f^{v). Thus, the overall computation time is proportional to the sum of the 
node degrees, which is just the number of tree edges, namely n — 1. □ 

We now compute the values C{v^) via treefix sums. Let S{v) denote the (weighted) degree of vertex v. 
Let p{v) denote the total weight of edges whose endpoints' least common ancestor is v. 

Lemma 5.9. C{v^) = 5^{v) - 2p^{v). 
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Proof. The term S^{v) coimts all the edges leaving descendants of v. This correctly counts each edge crossing 
the cut defined hy v^, but also double-counts all edges with both endpoints descended from v. But an edge 
has both endpoints descended from v if and only if its least common ancestor is in . Thus the total weight 
of such edges is pJ^(v). □ 

Now note that the functions 5{v) and p{v) can both be computed for all v in 0(m) time. Computing 

5 is trivial. Computing p is trivial if we know the least common ancestor of each edge, but these can 
be determined in 0(m) time [GT85, BV93, SV88]. Prom these quantities, according to Lemma 5.8, two 
0(n)-time treefix computations suffice to determine the minimum cut. This proves Lemma 5.1. 

In Section 8, we will describe a class of graphs for which this "1-respects" test is sufficient to find the 
minimum cut. 

6 Minimum Cuts that 2-respect a Tree in O(n^) Time 

We now extend our dynamic program to find the minimum cut that 2-respects a given tree in O(n^) time. 
This will immediately yield an 0(n^ logn)-time algorithm for finding all minimum cuts. The dynamic 
program can be parallelized, yielding an n^-processor TZMC algorithm for the problem. The trace of the 
dynamic program also provides a small-space data structure representing all the near-minimum cuts. 

6.1 A sequential algorithm 

This section is devoted to proving the following: 

Lemma 6.1. The values of all cuts that 2-respect a given tree can he found in 0{n'^) time. 

A sequential algorithm is an immediate corollary (using Lemma 4.3): 

Theorem 6.2. There is a Monte Carlo algorithm that finds a minimum cut with constant probability in 
0{n'^) time and finds all minimum cuts with high probability in O(n^logn) time. 

We now prove the lemma. After rooting the given tree and applying the algorithm of Section 5 in case 
some tree 1-constrains the minimum cut, we can assume that (for the right tree) exactly 2 edges cross the 
minimum cut. These two edges are the parent edges of two vertices v and w. 

Definition 6.3. We say vertices v and w are incomparable^ writing t; _L w, \i v ^ w-^ and w ^ . In other 
words, V and w incomparable if they are not on the same root-leaf path. 

Suppose the minimum cut that 2-respects the tree cuts the parent edges of vertices v and w. If w _L w, 

then the parent edges of v and w define a cut with value C(w^ U w^). If (without loss of generality) v & , 
then their parent edges define a cut with value C(w^ — v^). We start with the first case. Assuming _L w, 

C{v^ U w^) = C{v^) + C{w^) - 2C{v^, w^) 

because C(?;^) andC(w'') each include the edges of C(w''^,w'') that should not be counted in C(i>''- Uw-'-). Since 
the values C{v^) are already computed for all v, we need only compute C{v^,w^) for every v and w. This 
can be done using treefix sums as in the previous section. First, let 

fy{w) = C{V,W) 

be the weight of the edge connecting v to w, or if there is no such edge. It follows that 

fl{w)=Civ,w^). 

Therefore, we can determine all values 

gw{v) = Civ,^)^) = f^{w) 
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in O(n^) time by performing n treefix computations, one for each At this point, since 



we can determine all the desired quantities with another n treefix sums, one for each g^. 
Now consider the second case ofvGw^. In this case, 

for subtracting C{v^) from C{w-^) correctly subtracts the edges connecting v-^ to w\ but also incorrectly 
subtracts all the edges running from to — v^. But we claim that 

C{v''^ , — v'^) = gjjj{v) — 2p'^{v). 

For gjj^ [v) counts every edge with one endpoint in and the other in , which includes everything we want 
to count but also incorrectly counts (twice) all edges with both endpoints in v^. This is cancelled by the 

subtraction of 2p^{v). 

Thus, once we have computed g^,, computing the cut value for each pair v G in 0{n^) time is trivial. 

6.2 A parallel algorithm 

We can parallelize the above algorithms. Note that they involve two main steps: finding a proper packing of 

trees and finding the minimum cut constrained by each tree. Recall from Corollary 4.2 that the algorithm 
of [PST91] can be used to find a satisfactory packing in 0(log^ n) time using m + nlogn processors. So we 
need only parallelize the above algorithms for a particular tree. But the only computations performed there 
are least common ancestor lookups, which can be performed optimally in parallel by, e.g., the algorithm 
of Schieber and Vishkin [SV88]; and treefix sum computations, which can also be performed optimally in 
parallel [KR90, Section 2.2.3]. 

Corollary 6.4. Minimum cuts can be found in TZMC with high probability in 0(log^ n) time using ri^/ log^ n 
processors for general graphs. 

6.3 Data Structures 

Previously, a linear-space representation of all minimum cuts was known [DKL76] , but the best representation 
of approximately minimum cuts for general a required G(n^") space [KS96]. Benczur [Ben95] gave an 0(n^)- 
space representation for a < 6/5. We can now do better. 

Theorem 6.5. Given a graph with k a-minimum cuts, a < 3/2, there is a data structure that represents 
them in 0{k + nlogn) = 0{n?) space that can be constructed in 0{n?\ogn) time or in TZMC using 
processors. The value of a near-minimum cut can be looked up in 0{n) time. 

Proof. The output of our 0(n^)-time algorithm can serve as a data structure for representing the near- 
minimum cuts that 2-respect the tree it analyzes. We simply list the pairs of edges that together induce a 
near minimum cut. Now consider any a < 3/2. We know that with high probability every a-minimum cut 
2-respects one of the O(logn) trees we inspect and will therefore be found. 

Notice that given a cut (specified by a vertex partition) we can use this data structure to decide if it is 
a-minimum, and if so determine its value, in 0(n log n) time: simply check, for each of the O(logn) trees in 
the data structure, whether at most 2 tree edges cross the cut. If the cut is a small cut, there will be some 
tree for which this is true. We will have recorded the value of the cut with the edge pair (or singleton) that 
crosses it. 

We can improve this query time to 0{n) if we use a perfect hash function to map each vertex set that 
defines a small cut to a size-O(fc) table that says which tree that particular small cut 2-respects. Given a 
query (vertex set) we can map that vertex set into the hash table and check the one tree indicated by the 
appropriate table entry. □ 
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In the sequential case, this data structure can actually be constructed dctcrniinistic;ally if wc arc willing to 
take some extra time. Simply perform the initial tree packing in the original graph rather than the skeleton. 
By Lemma 2.3, we know that any one minimum cut 2-respects at least half the trees of this packing. Thus, 
if wc pick a random tree, wc expect it to 2-constrain half the minimum cuts. It follows that some tree 
2-constrains half the minimum cuts. We can find such a tree deterministically via an exhaustive search. 
First, enumerate all the small cuts using, for example, the deterministic algorithm of [VY92]. Then, check 
each tree to find one that 2-constraints half the minimum cuts. In the case of an unweighted graph, wc must 
check c < m trees. In a weighted graph, we can start from a maximum packing with a polynomial number 
of distinct trees in it [Bar95] so that we can check them all in polynomial time. We now recursively find 
trees that 2-constrain the remaining minimum cuts that the first tree missed. Since we halve the number of 
remaining min-cuts each time, we only need log2 n^" trees. 

We can easily extend this approach to represent a-minimum cuts for any constant a. Simply observe 
that Theorem 4.1 generalizes to argue that of the O(logn) trees wc find there, a constant fraction will [2a\- 
respect any given a-minimum cut with high probability. We can therefore represent all a-minimum cuts by 
considering all sets of [2aJ edges in each tree. For any one tree, the values induced by all sets of [2aJ edges 
can be found in 0(nl-^"-l) time using treefix computations as above. 

Lemma 6.6. There is a data structure that represents all k of the a-minimum cuts in a graph using 
0{n log n+ka) = 0{an^'^°'^ ) space. Its randomized construction requires 0(n'-^"J ) time and space or 0{n^'^°'^ ) 
processors in TZMC. It can also be constructed deterministically in polynomial time. 

7 Near-Linear Time 

We now reconsider the algorithm of Section 6 and show how its objective can be achieved in 0(m log^ n) 
time on any given tree. In this section we prove the following lemma. 

Lemma 7.1. The minimum cut that 2-respects a tree can be found in O(mlog^n) time. 

Applying Lemma 4.3 yields our main theorem: 

Theorem 7.2. The minimum cut of a graph can be found with constant probability in 0{m log^ n + n log^ n) 
time or with high probability in O(mlog^n) time. 

To prove the lemma, as in Section 6, wc first consider finding the pair minimizing C{v^ U w^) for u _L w; 
the almost identical case of minimizing C{w^ — v-^) for € w-'- is deferred to the end of the section. Rather 
than explicitly computing C{v^ U w^) for all pairs v and w, which clearly takes 6(n^) time, we settle for 
finding for each v a vertex w that minimizes C{v^ l^w^). This will clearly let us identify the minimum cut. 

7.1 Precuts 

Recall that for any vertices w _L u, 

C{yi U w^) = C{v^) + C{w^) - 2C{v^,w^). 
We now factor out the contribution of C{v^) to this quantity: 
Definition 7.3. The v-precut at w, denoted Cy{w), is the value 

C„(w) = C{v^ U w^) - C{v^) = C{w^) - 2C{v^, w^) 

if w -Lv and oo otherwise. 

Definition 7.4. The minimum v-precut, denoted C„, is the value 

min{Cv{w) \ 3{v',w') G E, v' G , w' G w^} 

The point of this somewhat odd definition is to notice that for a given v, only certain vertices w are 
candidates for forming the minimum cut with v, as is seen in the following lemma. 
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Lemma 7.5. If it is defined by incomparable vertices, the minimum cut is mmy{C{v^) +Cy). 

Proof. Let the minimum cut be induced by v and a vertex w J- v. It suffices to show that w is in the set 
over which we minimize to define C„ in Definition 7.4. But this follows from the fact that each side of the 
minimum cut induces a connected subgraph of G, for this implies that there must be an edge {v' , w') from 
to . To see that each side of the minimum cut must be connected, note that if one side of the minimum 
cut has two disconnected components then we can reduce the cut value by moving one of these components 
to the other side of the cut. □ 

Suppose that we have already used the linear-time procedure of Section 5 to compute the values C{v'^) 
for every v. It follows from Lemma 7.5 that it suffices to compute Cy for every vertex v: the minimum cut 
can then be found in 0{n) time. We now show how these minimum precuts can be computed in 0(m log^ n) 
time on any tree. We first show that we can find Cy for a leaf v in time proportional to its degree. We then 
extend this approach to handle a bough — a path rising from a leaf until it reaches a vertex with more than 
one child. Finally, we handle general trees by repeatedly pruning boughs. 

We begin by giving each vertex w a variable val[u;]. While working on vertex v, this variable will be 
used to accumulate Cy{w). Initially, we set val[w] = C{w^). 

7.2 A Leaf 

First consider a particular leaf v. To compute C„, consider the following procedure: 

1. For each vertex w, subtract '2C{v,w^) from val[w] (so that val[w] = Cy{w)). 

2. is the minimum of val[t«] over all w _L that are ancestors of a neighbor of v. 

The correctness of this procedure follows from Lemma 7.5, the initialization of val[w], and the definition of 

To implement this procedure efficiently, we use the dynamic tree data structure of Sleator and Tar- 
jan [ST83]. Given a tree, this structure supports (among others) the following operations on a node v: 

AddPath(w,a;): add x to val[u] for every u e . 

MinPath(v): return min val[w] as well as the u achieving this minimum. 

Each such dynamic tree operation takes O(logn) amortized time steps. 

We use dynamic trees to compute C„. Recall that C{v, u) is the weight of the edge connecting vertices v 
and u, or if no such edge exists. Apply procedure LocalUpdate from Figure 7.2. 



Algorithm LocalUpdate (?;) 

1. Can AddPath(w,oo). 

2. For each edge ((;,«,), call AddPath(ii, — 2C(t', it)). 

3. For each edge (u, u), call MinPath(M) 

4. Return the minimum result of Step 3. 



Figure 1: procedure LocalUpdate 
Lemma 7.6. After Step 2 in a call to LocalUpdate(w), every w has valfw] = Cy{w). 

Proof. Step 1 assigns an infinite val[!i;] to every ancestor w of v, as required in the definition of Cv{w). Next, 
2C{v, u) is subtracted in Step 2 from every val[u'] such that u & . Therefore, the total amount subtracted 
from val[w] is 2C{v,w^), as required in Definition 7.3. It follows that after Step 2, val[w] = Cy{w). □ 
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Lemma 7.7. LocalUpdate(u) called on a leaf v returns Cy. 

Proof. Step 3 minimizes over w ^- v that are ancestors of neighbors of v. According to Definition 7.4 this 
will identify C^. □ 

It follows that for a leaf v with d incident edges, we can find C„ via 0{d) dynamic tree operations that 
require 0{d\ogn) time. 

7.3 A Bough 

Wc generalize the above approach. Let a hough be a maximal path starting at a leaf and traveling upwards 
until it reaches a vertex with more than one child (this vertex is not in the bough) . 

Lemma 7.8. Let v he a vertex with a unique child u. Then either Cy = C„, or else Cy = Cy{w) for some 
ancestor w of a neighbor of vertex v. 

In other words, given the value C„, we need only "check" ancestors of neighbors of v to determine Cy. 

Proof. We know that Cy = Cy{w) for a certain w. Suppose w is not an ancestor of a neighbor of v. Then 

there is no edge from v to w^, meaning C{v^,w^) = C{uK w^). It follows that C„ < Cu{w) = C„{w) = Cy. 
But u E implies that for any x, C{u^,x-^) < C{v^,x-^), so wc know that Cu > Cy. Therefore Cy =Cv. O 

We use the above lemma in an algorithm for processing an entire bough. Given a bough with d edges 
incident in total on its vertices, we show how to process all vertices in the bough in O(dlogn) time. We use 
the recursive procedure MinPrecut from Figure 7.3. This procedure is initially called on the topmost vertex 
in the bough. Although we have formulated the procedure recursively to ease the proof exposition, it may 
be more natural to unroll the recursion and think of the algorithm as executing up from a leaf. 



Algorithm MinPrecut(i') 

if V is the leaf of the bough 

call LocalUpdate(w) (also computes Cy) 

else 

Let u be the child of v 

c\ •*— MinPrecut(u) (updates some val[-] entries) 
C2 ^ LocalUpdate(w) 
return min(ci,C2) 



Figure 2: procedure MinPrecut 

We claim that MinPrecut computes the desired quantity. To prove this, we prove a stronger statement 
by induction: 

Lemma 7.9. A call to MinPrecut(w) 

1. sets valfw] = Cy{'w) for each w, and 

2. returns Cy . 

Proof. By induction. Wc first prove Claim 1. The base case is for v a leaf, and follows from our analysis 
of LocalUpdate for a leaf in Lemmas 7.6 and 7.7. Now suppose v has child u. After calling MinPrecut(u), 
we know by induction that val[w] = C„(w). Thus, after we execute LocalUpdate(t;), which (as shown in 
Lemma 7.6) decreases each val[t«] by C{v,w^), entry val[w] is updated to be 

Cu{w)-2C{v,w^) = (C(w^) - 2C(u^«;^)) - 2C(u,u;^) 

= C{w^) -2C(v\Ju^,w^) 

= C(w^) - 2C{v^ ,w^) 

= Cy{w) 
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Wo now prove Claim 2. Suppose v has ehild u. According to Lemma 7.8, there arc two possibihties for 
Cy. One possibihty is that Cy = C„. But by induction, the recursive call to MinPrecut(M) sets Ci = C„. On 
the other hand, if Cy ^ C„, then by Lemma 7.8, Cy = Cy{w) for an ancestor of a neighbor of v. But then C2 
is set to Cy when we call LocalUpdate(v). In either case, the correct value is min(ci, C2). □ 

Based on this lemma, the correctness of the algorithm for finding all minimum precut values in a bough 
is immediate. 

7.4 A Tree 

We now use the bough algorithm repeatedly to analyze a particular tree. We still assume for now that the 
cut is defined by w _L w; we consider the other case in the next subsection. Wc have already argued that we 
can perform the computation for a single bough using a number of dynamic tree operations proportional to 
the number of edges incident on the bough. Doing so will change the values val[t(;]. However, we can return 
them to their original values by repeating the execution of MinPrecut on the bough, replacing each addition 
operation by a subtraction. 

It follows that we can process a collection of boughs in sequence since each computation on a bough 
starts with the val[w7] variables back at their initial values C{w^). Now note that every edge of G is incident 
on at most 2 boughs. It follows that the total size (number of incident edges) of all boughs of our tree is 
0{m). Therefore, the time to process all boughs of the tree with the algorithm of the previous section is 
0(m log n). 

Once we have processed all boughs, we know that we have found the minimum cut if even one of the two 
tree-edges it cuts is in a bough. If not, we can contract each bough of the tree (that is, merge all vertices in 
a bough into the vertex from which that bough descends) without affecting the minimum cut. This can be 
done in 0(m) time (using a bucket sort). It leaves us with a new tree that 2-constrains the minimum cut of 
the contracted graph. Furthermore, every leaf of the new tree had at least two descendants in the original 
tree (else it would have been in a bough). This means that the new tree has at most half as many leaves 
as the old. It follows that after 0(log n) iterations of the procedure for boughs followed by contractions of 
boughs, we will have reduced the number of tree- leaves to 0, meaning that we will have contracted the entire 
graph. Clearly at some time before this, we found the minimum cut. 

7.5 Comparable v and w 

It remains to explain the procedure for finding Cy{w) when (without loss of generality) w G w^. We proceed 
much as before. In Section 6, we showed that 

C{w^ - v^) = C{w^) - C{v^) + 2(gi,(v) - 2p^{v)). 

It is therefore sufficient to compute, for each v, after factoring out C{v^) + Ap^{v), the quantity 

Cy{w) = m:mC{w^) + 2g}^{v). 

We again do so using the dynamic tree data structure. We initialize val[w] = C{w^). We then use dynamic 
tree operations to add 2g^{v) to val[w]. Suppose first that u is a leaf. Then for each neighbor u of v, we 
call AddPath(M, 2C(w, u)). Having done so, we can extract our answer as MinPath(t)). This follows because 
gl,{;v) = "^^^yi C{x, w^). So to add 2gl^{v) to w, we want to add twice the capacity of every edge from v to 
each descendant u G w^. 

Also as before, we can process an entire bough by working up from the leaf. For each vertex v' in the 
bough, we execute kddPa.t'h{u' ,C{v' , u')) operations for each neighbor u' of v' and then compute MinPath(u'). 
By the time we call MinPath(w'), we will have val[w] properly updated for each ancestor w of v' . After 
processing each of the boughs, we contract all of them and recurse on the resulting tree. Note that unlike 
the case of w _L u, here we perform only one MinPath(w') operation for each vertex v' in the bough, rather 
than performing one for each of its neighbors. 
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8 A Simple Class of Graphs 



In this section, we describe a class of graphs that are easier to solve. On these graphs we can find the 

minimum cut using only our simple algorithm for cuts that 1-respcct a tree. Since wc can recognize these 
graphs quickly, in practice we might often be able to avoid running the more complicated algorithm for the 
2-respecting case. 

The Nash- Williams lower bound of c/2 trees in a graph with minimum cut c is often pessimistic. The 
number of trees in the maximum packing may be as large as c (the exact number is determined by the value 
of the "sparsest cut" of the graph as discussed in Section 2 — see [NW61] for details). Call G a 5-fat graph 
if its maximum packing contains more than (1 + S)c/2 trees. 

Theorem 8.1. For any constant 6 > Q, the minimum cut in a 5-fat graph can he found with constant 
probability in 0{m + nlog'' n) time and thus with high probability in 0(m log n + nlog'^ n) time. 

Proof. Consider a maximum packing in a ^-fat graph. It has (1 + 8)c/2 trees (by weight) sharing c minimum 
cut edges. Thus the average number of minimum cut edges per tree is at most 2/(1 + 5) < 2. Thus, by 
Markov's inequality, there is a constant probability that the number of min-cut edges in a random tree is 
less than 2, meaning that it must be exactly 1. 

As before, wc cannot actually take the time to find a maximum packing. However, wc can use the same 
skeleton construction as Theorem 4.1. We have shown [Kar93b] that if a graph is (5-fat then, with high 
probability, any skeleton we construct from it is [6 — e)-fat. Consider the skeleton H with minimum cut 
c' = O(logn). Since it is (5-fat, the maximum packing has [1 + 5 — e)c'/2 trees. Wc can therefore find a 
packing of (1 -|- 5/2 — e)c'/2 trees in 0(n log^ n) time using the Plotkin-Shmoys-Tardos algorithm (note that 
Gabow's algorithm cannot be applied here as it always gives a packing of weight exactly c'/2 regardless of 
the graph's fatness). These trees share the at most (1 + e)c' edges of the cut in H corresponding to the 
minimum cut of G. Thus, so long as e <C 5 (e = 5/5 suffices), the average number of cut edges per tree is a 
constant less than 2. Thus, as in Lemma 2.3, a constant fraction of the trees (by weight) will 1-respect the 
minimum cut of G. 

Given such a tree, we can use the algorithm of Section 5 to identify the minimum cut in linear time. 
The constant-probability argument follows by selecting a random tree and running on it our linear-time 
algorithm for finding the minimum cut that 1-respects it. The high probability bound follows from selecting 
O(logn) random trees and running our algorithm on each. On sparse graphs, the bottleneck is now the 
packing algorithm of [PST91] . □ 

The above algorithm can be applied even if we do not know initially that our graph is e-fat, for the 
heuristic is actually "self certifying." Once we have used the tree to find the minimum cut in the graph, 
we can compare the size of our tree-packing to the value c" of the corresponding cut in the skeleton (which 
we know is near-minimum with high probability). If the packing has (1 -I- e)c"/2 trees, then we know the 
skeleton is e-fat. By a Theorem of [Kar93b], with high probability the; skcileton is r-fat if and only if the 
original graph is. Unfortunately, the certification is correct with high probability but not with certainty 
(there is a small chance the skeleton is e-fat but the graph is not). Thus, the heuristic does not guarantee 
that we have found the minimum cut. Rather, we have the following: 

Lemma 8.2. // a graph is e-fat, then with high probability the e-fat heuristic certifies itself and returns the 
minim,um cut. If a graph is not e-fat, then with high probability that e-fat heuristic does not certify itself. 

Thus, so long as we are satisfied with a Monte Carlo algorithm, we can halt if the algorithm certifies 
itself. 

Although fat graphs seem to form a natural class, it should be noted that they form a small class: a 
random graph (in the standard model of BoUobas [Bol85]) is highly unlikely to be fat. Its minimum cut is 
within 1 + o(l) of the average degree k, and counting edges shows that a graph with average degree k can 
have at most k/2 spanning trees. 

9 Small Refinements 

In this section, we give two very small refinements of our algorithm that improve its running time to 
the unhappy bound of 0(m(log^ n) log(n^/m)/ loglogn). Such a messy bound suggests to us that further 
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improvements should be possible. 



9.1 Fewer trees 

Our first improvement reduces the number of trees that we must analyze for 2-constrained cuts. 

Lemma 9.1. A minimum cut can be found with high probability in 0{m log^ nj log log n + n log® n) time. 

Proof. We use the fact that trees can be analyzed more quickly for 1-respecting cuts than for 2-respecting 
cuts. In our initial sampling step (discussed in Section 4), set e = 1/4 logn. Using Gabow's algorithm to 
pack trees in this sample takes 0(nlog''n) time. In [Kar97a] we give a speedup of Gabow's algorithm that 
improves the running time to 0(n log® n). 

In a second phase, choose 41og^n trees at random from the packing and find the minimum cut that 1- 
respects each in O(mlog^n) time. In a third phase, choose logn/loglogn trees at random from the packing 
and find the minimum cut that 2-respects each of them in 0(m log^ n/ log logn) time. Our speedup arises 
from checking logn/ log logn trees for the "2-respects" case instead of logn. We claim that this algorithm 
finds the minimum cut with probability 1 — 0(l/n). 

To see this, consider the sampled graph. It has minimum cut c'. The minimum cut of G corresponds to 
a cut of value at most (1 + e)c' in the sample. The tree packing has value p > c' /2, of which say ap trees 
1-respect the minimum cut and (3p trees 2-resp(;ct the minimum cut. Note that since any tree which is not 
counted by a or /3 must have at least 3 cut edges, we have 

ap + 2/3p + 3(l-a-/3)p < (1 + e)c' < 2(1 + e)p 

a + 2P + S{l-a- f3) < 2(1 + e) 

3-2a- P < 2 + 2e 

13 > 1 - 2e - 2a 



Suppose first that a > 1/4 log n. Then each time we choose a random tree, the probability is a that we 
will choose one that 1-respects the minimum cut. Thus if we choose 41og^ n random trees, the probability 
that none of them 1-respects the cut is at most 

(l-l/41ogn)*'°s'"<l/n. 

Thus, with high probability in the second phase we choose and analyze a tree that 1-respects the minimum 
cut, and therefore find the minimum cut. 

Now suppose that a < 1/4 logn. It follows from the previous inequalities and the fact that e = 1/4 logn 
that 

13 > 1-1/ logn 



It follows that when we choose logn/ log logn trees and test them for 2-respecting cuts in the third phase, 
our probability of failing to choose a tree that 2-respects the minimum cut is only 



(I _ ^yog ra/ log log n ^ 



-j^ \ log n/ log log n 



logn 
= 1/n 

Thus, regardless which case holds, we find the minimum cut with probability at least 1 — 1 /n. 

□ 
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9.2 Solving one tree faster 

Our second improvement reduces the time it takes to analyze a single tree. 

Lemma 9.2. The smallest cut that 2-respects a given tree can be found in 0(m log n log (n^/m)) time. 

Proof. Wc previously bounded the time to analyze a bough with d incident edges by 0{dlogn). We show 
that in an amortized sense, we can also bound this time by O(nlogn). We thus break the time to process 
a tree with up to n leaves into two parts. The time to reduce the number of leaves (and boughs) to I is (as 
was shown in Section 7) 0(TO(logn) logn/l). The time to process the Z-leaf tree is (by the amortized bound) 
0(n/ log n). Overall, the time to process the tree is 

0{{m log n/l + nl) log n). 

Choosing I = m/n yields the claimed time bound. 

To prove the amortized time bound, suppose we process a bough with d incident edges. Afterward, we 
contract that bough to a single node. Assuming we merge parallel edges diiring the contraction, a single 
node can only have n incident edges. Thus, d — n oi the edges incident on the bough vanish. We charge 
the dynamic tree operation for each edge that vanishes to the vanishing edge, and charge the at-most n 
dynamic tree operation for non-vanishing edges to the algorithm. Over the entire course of the algorithm, 
each edge gets charged once (for a negligible total of 0{m) operations) while the algorithm gets charged 
0{nl) times. □ 

10 Conclusion 

This paper has presented two algorithms for the minimum cut problem. The first is quite simple and runs 

in 0{n^ logn) time (and on fat graphs in 0(m log n) time). The second algorithm is relatively complicated, 
but holds out the possibility that the minimum cut problem can be solved in linear time. Several probably 
unnecessary logarithmic factors remain in the running time, suggesting the following improvements: 

• Give a dynamic path-minimization data structure taking constant amortized time per operation. This 

would reduce the running time by a logn factor. We are not using the full power of dynamic trees (in 
particular, the tree we are operating on is static, and the sequence of operations is known in advance), 
so this might be possible. 

• Extend the approach for boughs directly to trees in order to avoid the O(logn) different phases needed 
for pruning boughs. This too would reduce the running time by a logn factor. 

• Give a deterministic linear-time algorithm for finding a tree that 2-constrains the minimum cut. This 
would eliminate the logn factor required by our randomized approach. Note that randomization is 
used only to find the right tree; all remaining computation is deterministic. Thus any o(mn)-time 
algorithm for finding a good tree would yield the fastest known deterministic algorithm for minimum 
cuts. 

Our algorithm is Monte Carlo. It would be nice to develop a deterministic, or at least a Las Vegas, 
algorithm based upon our ideas. One standard approach to getting a Las Vegas algorithm would be to 
develop a minimum cut "certifier" that would check the correctness of our Monte Carlo algorithm. 

Another question is whether the near-linear time algorithm can be parallelized. This would require 
finding a substitute for or parallelization of the sequence of dynamic updates the algorithm performs. 

Our tree-packing approach has also led to some progress on the enumeration of near- minimum cuts. The 
gap between upper and lower bounds is now extremely small and should be eliminated entirely. Except for 
one graph on 7 vertices [NNI94], the cycle appears to have the most small cuts, so it is the upper bound 
which seems likely to be improved. 
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A Optimality of the Cut Counting Proof 



In Theorem 3.3, we gave an upper bound of 



k + l-2a 



on the number of a-minimum cuts a graph may have. The upper bound held for every (and thus the smaUest) 
vahic of fc > [2q;J (to ensure > 0). The proof of this bound relied on a weighted bipartite graph on cuts 
and packed trees. It is not obvious that this proof has been "optimized." Perhaps a different assignment of 
edge weights would yield a better bound. In this section, we show that no better bound is possible. Our 
analysis was derived using linear-programming duality, but is presented in a simplified way that does not 
use duality. 

Our upper-bound proof can be thought of as a kind of game. We choose to assign a weight > to 
any edge connecting a cut C to a packed tree containing exactly r edges of C. The maximum weighted tree 
degree is therefore ^ Wr (") . We then consider the smallest possible weighted degree that a cut C could 
have given these weights. If rir trees contain exactly r edges of this minimum-degree cut, then its weighted 
degree is J^'^r^r- We deduce that the number of cuts is upper bounded by 

R{{Wr,nr}) = 

2^ WrUr 

The bound only holds if Ur defines the smallest possible cut degree, so our choice of weights Wr yields an 
upper bound of 

maxi?({u;r-, rir}) 

rir 

on the number of cuts. Since we are interested in as tight a bound as possible, we would actually like to 
compute 

min inaxR{{wr, rir}). 

Wr Ur 

Of course, we need not consider all possible values rir- In particular, since every tree has r cut edges for 
some r, we know that 

^nr= c/2. 

r 

Furthermore, since the trees must share the at most ac cut edges, we must have 

r ■ rir < etc. 

r 

Finally, we must clearly have 

rir > 

for every r. Conversely, for any rir satisfying these three equations, there clearly exist trees yielding these 
values rir- 

Using these restrictions on rir, we used a particular set of values Wr to show in Theorem 3.3 that 

mmmax.R{{'Wr,nr}) < min qk- 

Wr rir k> [2aJ 

We now prove that one cannot make a stronger argument. We exhibit a particular assignment of values rir 
for which 

min R(\wr, rir}) = min Qk- 

Wr- fc>L2aJ 
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Thus, no weight assignment ean prove a ratio better than mingfe. Since we have given weights that prove a 
ratio of ming^, we have an optimal bound- 
To show our analysis is optimal, fix k to minimize qk and consider the following values for n^: 




(r > fc + 1) 



We will shortly show that these Ur satisfy the three feasibility equations and thus correspond to some set of 
trees. We will also prove one more fact: for every r. 

This clearly holds with equality for r < A: + 1 and is trivial for r > fc + 1; we will show it holds for r = fc + 1 
as well. Assuming it for now, observe that for any weights Wr used in the proof, the ratio we get for our 
proof is 



R{Wr,nr)- 2^^'-^ 



> 



^ rirWr 
2 Sr</s (r) 
Sr<fe 2qi^ (r) 

qk- 



In other words, no weight assignment can yield a better bound than in the proof of Theorem 3.3. 

We now need to justify the claims made earlier in Equations 2, 3, 4, and 5. Equation 2 is obvious from 
the definition of nk+i- Next we prove Equation 3, showing that '^r ■ rir < ac. Simply note 




= |(fc + l- (fc + l-2a)) 
= ac 

as claimed. 

It remains only to prove Equations 4 and 5, which are non-obvious only for r = A; + 1. Plugging in the 
definition of rifc+i, we need to prove that 



r<k 

or equivalently (multiplying by 2qk/c) that 



°"2^^ qk^Xr)^-2qk\k + l 



r<k ^ ^ r<k+l ^ 



21 



We begin with the upper bound. Since qk < Qk+i (by choice of k to minimize qk), we deduce that 

qk = {k + 2- 2a)qk -{k + 1- 2a)qk 
<{k + 2- 2a)qk+i -{k + 1- 2a)qk 

r<k+l ^ ^ r<k ^ 



= E 

r<k+l 



as desired. Now consider the lower bound. If fc > [2a\, then we must have qk > qk-i, so we can work much 
as before: 

qk = {k + I - 2a)qk - {k - 2a)qk 
> (fc + 1 - 2a)qk+i - (fc + 2a)qk 



r<k ^ ' r<fc-l ^ 

=E(: 



r 

r<.k 

If, on the other hand fc = [2a\ , then just recaU that 

fc + 1 — r /n 



* = E 



k + l-2a\r 

r<k 

But note that for all r < k, the numerator fc + 1 — r > 1 while the denominator fc + 1 — 2a < 1. So the claim 
holds. 

This completes the analysis of our proof, and shows that now other weight function assignment will 
improve on the one wc have already given. Of course, this does not prove that there is no better bound on 
the number of cuts: a completely different proof could show one. 
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